Identifying Inliers
نویسندگان
چکیده
The problem of outliers is well-known in statistics: an outlier is a value that is far from the general distribution of the other observed values, and can often perturb the results of a statistical analysis. Various procedures exist for identifying outliers, in case they need to receive special treatment, which in some cases can be exclusion from consideration. An inlier, by contrast, is an observation lying within the general distribution of other observed values, generally does not perturb the results but is nevertheless non-conforming and unusual. For single variables, an inlier is practically impossible to identify, but in the multivariate case, thanks to interrelationships between variables, values can be identified that are observed to be more central in a distribution but would be expected, based on the other information in the data matrix, to be more outlying. We propose an approach to identify inliers in a data matrix, based on the singular value decomposition. An application is presented using a table of economic indicators for the 27 member countries of the European Union in 2011, where inlying values are identified for some countries such as Estonia and Luxembourg.
منابع مشابه
Globally Optimal Inlier Set Maximization with Unknown Rotation and Focal Length
Identifying inliers and outliers among data is a fundamental problem for model estimation. This paper considers models composed of rotation and focal length, which typically occurs in the context of panoramic imaging. An efficient approach consists in computing the underlying model such that the number of inliers is maximized. The most popular tool for inlier set maximization must be RANSAC and...
متن کاملInliers Detection Using Schwartz Information Criterion
In failure time distributions, inliers in a data set are subset of observations sufficiently small relative to the rest of the observations, which appears to be inconsistent with the remaining data set. They are either the resultant of instantaneous failures or early failures, experienced in many life-testing experiments. The model used in outliers, where r observations are outliers is modified...
متن کاملRobust Affine Motion Estimation in Joint Image Space Using Tensor Voting
Robustness of parameter estimation relies on discriminating inliers from outliers within the set of correspondences. In this paper, we present a method using tensor voting to eliminate outliers and estimating affine transformation parameters directly from covariance matrix of selected inliers without additional parameter estimation processing. Our approach is based on the representation of the ...
متن کاملConcurrent Tracking of Inliers and Outliers
In object tracking, outlier is one of primary factors which degrade performance of image-based tracking algorithms. In this respect, therefore, most of the existing methods simply discard detected outliers and pay little or no attention to employing them as an important source of information for motion estimation. We consider outliers as important as inliers for object tracking and propose a mo...
متن کاملAn Efficient Dictionary Based Robust PCA via Sketching
In this paper, we examine the problem of locating outliers from a large number of inliers with the particular interest when the outliers have known basis. By a convex formulation of demixing, we provide provable guarantees for exact recovery of the space spanned by the inliers and the supports of the outlier columns, even when the rank of inliers is high and the number of outliers is a constant...
متن کامل